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Abstract 

Granulocyte-colony stimulating factor (G-CSF) is used worldwide to prevent neutropenia caused by high- 
dose chemotherapy. It has limited stability, strict formulation and storage requirements, and because of poor 
oral absorption must be administered by injection (typically daily). Thus, there is significant interest in 
developing analogs with improved pharmacological properties. We used our ultrahigh throughput compu- 
tational screening method to improve the physicochemical characteristics of G-CSF. Improving these 
properties can make a molecule more robust, enhance its shelf life, or make it more amenable to alternate 
delivery systems and formulations. It can also affect clinically important features such as pharmacokinetics. 
Residues in the buried core were selected for optimization to minimize changes to the surface, thereby 
maintaining the active site and limiting the designed protein's potential for antigenicity. Using a structure 
that was homology modeled from bovine G-CSF, core designs of 25-34 residues were completed, corre- 
sponding to 10 2, -I0 28 sequences screened. The optimal sequence from each design was selected for 
biophysical characterization and experimental testing; each had 10-14 mutations. The designed proteins 
showed enhanced thermal stabilities of up to 13°C, displayed five- to 10-fold improvements in shelf life, and 
were biologically active in cell proliferation assays and in a neutropenic mouse model. Pharmacokinetic 
studies in monkeys showed that subcutaneous injection of the designed analogs results in greater systemic 
exposure, probably attributable to improved absorption from the subcutaneous compartment. These results 
show (hat our computational method can be used to develop improved pharmaceuticals and illustrate its 
utility as a powerful protein design tool. 
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Many techniques have been used in the design of new and 
improved proteins. In vitro directed evolution methods such 
as phage display, DNA shuffling, and error-prone PCR are 
widely used. Rational design approaches continue to be ap- 
plied, and strategies that combine both are now being used. 
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Successful designs include enzymes (Chen and Arnold 
1991; Stemmer 1994; Zhao et al. 1998) and other proteins 
(Crameri et al. 1996), as well as therapeutically useful pro- 
teins such as hormones and cytokines (Lowman and Wells 
1993; Heikoop et al. 1997; Grossmann et al. 1998; Chang et 
al. 1999). The experimental techniques involve the genera- 
tion and screening of libraries of random protein sequences. 
However, the number of sequences that can be screened ex- 
perimentally is limited (about I0' 4 for library panning and 10 7 
for high Ihroughput screening). Libraries of this size allow for 
the simultaneous modification of only about 10 residues. 
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positions, and the conformations of the replaced side chains 
were optimized using PDA. Most of the replaced residues 
were solvent exposed, thereby introducing little strain into 
the structure and allowing typical PDA parameters to be 
used for conformation optimization. One substitution, how- 
ever, was at a buried site, G167V, and clashed stericaljy 
with a nearby disulfide bond. To accommodate the larger 
Val, the side-chain conformation at this position was opti- 
mized using a less restrictive van der Waals scale factor (0.6 
instead of 0.9). The entire structure was then briefly mini- 
mized to relax the strain. The final structure that served as 
the template for all the designs is shown in Figure 1 . 

Core designs 

Unlike many experimental sequence screening methods, 
PDA allows control over which residues are allowed to 
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Fig. 1. Template structure of hG-CSF used for Protein Design Automation 
(PDA) designs. The human sequence was homology modeled onto the 
bovine crystal structure (PDB record Ibgc). The residues that differ in the 
bovine sequence or were not present in the bovine crystal structure were 
replaced with the residues from the human sequence. The conformations of 
the replaced side chains were optimized using PDA (the larger Val at 
posiiion 167 was optimized using a less restrictive van der Waals scale 
factor), and ihe entire structure was energy minimized for 50 steps. 



change. Core residues were selected because optimization 
of these positions can improve stability yet minimize 
changes to the molecular surface, thus limiting the designed 
protein's potential for antigenicity. Ala scanning studies of 
G-CSF indicate one or two binding sites on the protein 
surface that are probably responsible for granulopoietic ac- 
tivity (Reidhaar-Olson et al. 1996; Young et ah 1997) (Fig. 
I). Although recent crystallographic studies of G-CSF com- 
plexed to its receptor show only one binding site in a novel 
2:2 complex (Horan et al. 1996; Aritomi et al. 1999), both 
sites were avoided in the core designs to ensure preservation 
of function. 

Two PDA design calculations were run: a deep core de- 
sign that included residues deeply buried in the interior of 
the protein and an expanded core design (exp_core) that 
also included less buried peripheral core residues. The deep 
core design had 26 core positions that were allowed to vary 
(shown yellow and gold in Fig. 2), whereas exp_core had 34 
(shown yellow and turquoise in Fig. 2). Only hydrophobic 
amino acids were considered at the variable core positions. 
These included Ala, Val, lie, Leu. Phe, Tyr, and Trp. Gly 
was also allowed for the variable positions that had Gly in 
the bovine wild-type structure (positions 28, 149, 150, and 
167). Met and Pro were not allowed. 

Optimal sequences 

The optimal sequences selected by PDA are also shown in 
Figure 2. The optimal sequence from the deep core design 
had 10 mutations (named core 10), and the optimal exp_core 
sequence had 11 (named exp_corell); thus, 33%-38% of 
the variable residues changed their identities. Eight of the 
mutated positions changed to the same amino acid in both 
designs. Changing the set of design positions can signifi- 
cantly impact the amino acid selected at a given position. 
For example, in the deep core design, Leu89 retains the 
same amino-acid identity and conformation as wild type. 
However, in the explore design, when Leu92 is also al- 
lowed to vary, both positions (Leu89 and Leu92) mutate to 
Phe, indicating a coupling between these two core residues. 
The modeled structure of the sequence selected in the deep 
core design (core 10) is shown in Figure 3. 

Native human G-CSF (met hG-CSF) and the optimal se- 
quence from each of the core designs were cloned, ex- 
pressed in Escherichia coli y and purified for experimental 
studies. 

Thermal stability 

The far-ultraviolet (UV) circular dichroism (CD) spectra for 
met hG-CSF and the designed proteins were nearly identical 
to each other and to published spectra for met hG-CSF 
(Reidhaar-Olson et al. 1996; Young et al. 1997), indicating 
highly similar secondary structure and tertiary folds (data 
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Fig. 3. Modeled structure of hG-CSF analog (core 10) oblained from deep 
core design. Twenly-six core residues were allowed to vary; computational 
screening with PDA resulted in 10 mutations: C17L, G28A, L78F, Y85F 
LI03V, VIIOJ, Fl I3L, V151I, VI531. and L168F. 

To determine the importance of the other mutations, an- 
other sequence was made (core2) that contained only two of 
the core 10 mutations, G28A and C17A; all other residues 




' ' ■ ■ ' ' 

30 40 50 60 70 80 90 100 110 
Temperature f*C) ' 



Fig. 4. Thermal stability of hG-CSF analogs. Thermal stability was as- 
sessed by monitoring the temperature dependence of the circular dichroism 
spectral signal at 222 nm. Melting temperatures (T m s) were derived from 
the derivative curve of the ellipticity at 222 nm versus temperature. Core 10 
and core2 showed increases in T m of B^C and 5°C, respectively, over 
native met hG-CSF. 



were identical to wild type (Fig. 2). The T m of core2 was 
5°C higher than wild type, indicating that improvements in 
helical propensity and the elimination of a free cysteine arc 
important for heightened thermostability. The remainder of 
the increase in T m seen for corelO may be attributable to 
improved packing interactions and increased hydrophobic 
burial. 



Storage stability 

Increased shelf life is important for distribution and storage 
and is a desirable feature for G-CSF and other protein drugs. 
Because aggregation and chemical degradation are the pre- 
dominant mechanisms of inactivatiort of G-CSF (Herman et 
al. 1996), shelf life was estimated by incubating the proteins 
at elevated temperature and then using size-exclusion chro- 
matography to observe the disappearance of monomeric 
protein. Chemical degradation was estimated using reverse 
phase chromatography (data not shown). Core2 and core 10 
showed five- and 10-fold improvements in storage stability, 
respectively, at 50°C (Fig. 5). Rate constants were deter- 
mined by a first order exponential fit of the fraction mono- 
mer remaining/time curves using KaleidaGraph (Synergy 
Software). 



piological activity 

Granulopoietic activity was determined in vitro by quanti- 
sing cell proliferation as a function of protein concentra- 
tion in murine lymphoid cells transfected with the gene for 
the human G-CSF receptor. The designed proteins were as 
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Fig. 5. Shelf life of hG-CSF analogs. Shelf life was estimated by incubat- 
ing the proteins at elevated temperature (50°C) and using size exclusion 
chromatography to observe disappearance of monomeric protein. Rate con- 
stants were determined by a first order exponential fit of the fraction 
monomer remaining/time curves. Core2 and core 10 showed five- and 10- 
fold improvements in storage stability, respectively, over met hG-CSF 
controls. 
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Comparison to published G CSF variants 

In vitro and cassette mutagenesis studies have shown that 
alterations of the N -terminal region of G-CSF can lead to 
improved granulopoietic activity (Kuga et al. 1 989; Okabe 
et al. 1990). Point mutations at Cysl7 have also been found 
to affect shelf life; replacement with Ala led to an increase, 
Ser had no effect, and large residues (He, Tyr, Arg) led to a 
decrease (Ishikawa et al. 1992). In contrast, our core 10 se- 
quence, which has a large residue (Leu) at this position, 
showed an improved shelf life. This may be explained by 
the observation that in a Cysl7Leu point mutant, Leu's side 
chain would clash with the aromatic ring of the nearby Phe 
at position 1 13. This steric clash does not occur in core 10, 
however, because the Phe at 1 13 is replaced by Leu and, in 
compensation for this change, two nearby Leu's become 
Phe's (at positions 78 and 168). Thus, multiple mutations 
allow complementary repacking of the hydrophobic core in 
the corel 0 mutant and may be responsible for its enhanced 
stability and shelf life. 

Significant improvements in thermal stability were also 
observed when the seven helical Gly residues in G-CSF 
were replaced with Ala to form point, double, and triple 
mutants (Bishop et al. 2001). Substitutions at positions 26, 
28, 149, and 150 were the most effective. The investigators 
attributed the stabilizing effect to the enhancement in ot-he- 
lical propensity associated with the Gly/Ala substitutions. 
These data support our suggestion that the heightened ther- 
mal stability seen with our mutants (which also contain a 
Gly/Ala substitution at position 28) is at least in part attrib- 
utable to an improvement in helical propensity. 

Probing the robustness of PDA with 
a homology modeled core position 

As pointed out previously, the homology modeling of hu- 
man G-CSF onto the bovine structure was straightforward 
for the most part because the replaced residues were prima- 
rily solvent exposed and no rearrangement of the backbone 
was necessary. The change at one core position, however, 
G167V, induced a steric clash and energy minimization of 
the entire protein was used to relieve the strain. We decided 
to assess the impact of this manipulation by , doing an addi- 
tional design (core 167V) in which the variable residues 
were essentially the same as in the deep core design except 
that position 167 was also allowed to vary. We found that 
Vail 67 mutated to Ala (the other mutations were essentially 
the same as for corel 0). To probe the plasticity of the core, 
instead of using this PDA optimal sequence, which only had 
two mutations in this region, we ran experiments on another 
high-scoring sequence (core!4_V167A) that had additional 
mutations (14 total, including LI 571, F160W, and L161F). 
This sequence was chosen because it balanced an extensive 
number of mutations with a relatively high design score. 



Although it ranked 21st in the sequence energy list and was 
2 kcal/mole less favorable than the optimal sequence, it was 
still biologically active and as stable as wild type (T m of 
61°C) (Figs. 2, 4). This indicates that optimization with 
PDA is fairly robust, and that the protein core can be quite 
plastic and can accommodate large changes without sacri- 
ficing stability or function. 

Conclusions 

PDA is a powerful ultrahigh throughput computational 
screening method. Its ability to screen up to 10 80 sequences 
and allow multiple simultaneous mutations significantly in- 
creases the likelihood of finding new and improved pro- 
teins. In this study, PDA was used to develop improved 
analogs for a therapeutically important protein, hG-CSF. 
The novel proteins showed enhanced thermal stabilities and 
shelf life while retaining biological activity. Analysis of the 
mutants and results obtained with derived sequences indi- 
cates that the heightened stability is attributable to improve- 
ments in helical propensity and the elimination of a free 
cysteine; improved core packing and optimized hydropho- 
bic burial of side chains may also be important. Pharmaco- 
kinetic studies indicate that subcutaneous injection of the 
most stable variant results in greater systemic exposure, 
probably attributable to improved absorption from the sub- 
cutaneous compartment. 

These results show that PDA can be successfully applied 
to proteins of therapeutic interest. They also illustrate the 
value of its precise control over the site and type of muta- 
tions, allowing for the rational design of desired properties 
such as improved stability and pharmacokinetics and the 
elimination of undesirable ones such as toxicity and antige- 
nicity. These features are particularly important in the de- 
sign of therapeutic proteins. PDA thus has great potential as 
a powerful in silico tool for therapeutic protein design. 

Materials and methods 

Template structure preparation 

The template structure fpr the designed proteins was produced by 
homology modeling using the crystal structure of bovine G-CSF 
(Brookhaven Protein Data Bank code Ibgc) as the starting point. 
The program BIOGRAF (Molecular Simulations Inc., San Diego, 
CA) was used to generate explicit hydrogens on the structure, 
which was then minimized for 50 steps using the conjugate gra- 
dient method and the Dreiding II force field (Mayo et al. 1990). 
The residues that differ in the bovine sequence or were not present 
in the bovine crystal structure were replaced with the human resi- 
dues for those positions. The conformations of die replaced side 
chains were optimized using PDA (Dahiyat and Mayo 1997a,b), 
and the entire structure was minimized again for 50 steps. This 
minimized structure was used as the template for all the designs. 
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stored (-70°C). Plasma concentrations were determined using an ' 
enzyme-linked immunosorbent assay (Quantikine human G-CSF 
EUSA, R&D Systems, Minneapolis, MN), performed per manu- 
facturers instructions except that samples were diluted in PBS, 5% 
nonfat dry milk, and 0.05% Tween 20, and the incubation was 
extended to overnight at 4°C Plasma concentrations of the de- 
signed hG-CSF analog and filgrastim were estimated from their 
corresponding standard curves. Pharmacokinetic parameters were 
calculated by noncompartmental analysis. The terminal slope (kz) 
was estimated by linear regression through the last time points of 
the log concentration versus time curves and used to calculate the 
terminal half-life (t 1/a ). The area under the curve from time of 
dosing through the last time point (AUC a J was calculated by the 
linear trapezoid method. 

Acknowledgments 

We thank Dr. Belinda Avalos (Ohio State University) for kindly 
supplying the Ba/F3 cell line transfected with the hG-CSF recep- 
tor. We also thank Dr. Steven Adams (American College of Labo- 
ratory Animal Medicine) and LAB Preclinical Research Institute 
Inc., (Quebec, Canada) for conducting the monkey studies. 

The publication costs of this article were defrayed in part by 
payment of page charges. This article must therefore be hereby 
marked "advertisement" in accordance with 18 USC section 1734 
solely to indicate this fact. 

» 

References 

Aritomi. M., Kunishima, N., Okamoto, T., Kuroki, R., Ola, Y.» and Morikawa. 
K. 1999. Atomic siructure of the GCSF-receptor complex showing a new 
cytokine-reccptor recognition scheme. Nature 401: 713-717. 

Avalos, B.R.. Hunter, M.G- Parker. J.M„ Cesclski, S.K., Druker, BJ_ Corey, SJ., 
and Mehta, V.B. 1995. Point mutations in the conserved box 1 region inactivate 
the human granulocyte colony-stimulating factor receptor for growth signal 
transduction and tyrosine phosphorylation of p75c-rel. Blood 85: 3117-3126. 

Bishop, B., Koay, D.C, Sanorelli. AC, and Regan. L. 2001. Reengineering 
granulocyte colony-stimulating factor for enhanced stability. J. Biol. Chem. 
276: 33465-33470. 

Bolon. D.N. and Mayo, S.L. 2001. Enzyme-like proteins by computational 
design. Proc. Natl Acad. Sci. 98: 14274-14279. 

Chang, C.C., Chen, T.T., Cox. B.W., Dawes : C.N, Stemmer, W.P., Punnonen, 
J„ and Patten. P.A. 1999. Evolution of a cytokine using DNA family shuf- 
fling. Nat. Biotcclmot. 17: 793-797. 

Chen, K.Q. and Arnold, F.H. 1991. Enzyme engineering for nonaqueous sol- 
vents: Random mutagenesis to enhance activity of subtilisin E in polar 
organic media. Biotechnology 9: 1073-1077. 

Crameri, A., Whttehorn, E.A., Tale, E., and Stemmer, W.P. 1996. Improved 
green fluorescent protein by molecular evolution using DNA shuffling. Not. 
Biutechnof. 14: 315-319. 

Dahiyal, B.I. and Mayo, S.L. 1996. Protein design automation. Protein Sci. 5: 
895-903. 

. 1997a. De novo protein design: Fully automated sequence selection. 

Science 278: 82-87. 
. 1997b. Probing ihe role of packing specificity in protein design. Proc. 

Natl. Acad. Sci. 94: 10172-10177. 
Desjarlais, J.R. and Handel. T.M. 1995. De novo design of the hydrophobic 

cores of proteins. Protein Sci 4: 2006-20)8. ' 
Dunbrack, R.L- and Karplus, M. 1993. Backbone-dependent rotamer library for 

proteins— an application to side-chain prediction. / Mol. Biol. 230: 543-574. 
Grossmann, M.. LeitoJf, H. T Wcintraub, B.D., and Srtcudlinski, M.W. 1998. A 

rational design strategy for protein hormone superagonists. Nat. Biotechnol. 

16: 871-875. 

Hatton\ K., Shimizu, K., Takahashi, M., Tamura, M., Oheda, M.. Ohsawa, N., 
and Ono, M. 1990. Quantitative in vivo assay of human granulocyte colony- 
stimulating factor using cyclophospbamide-induced neutropenic mice. 
Blood 75: 1 228-1 233. 

Heikoop, J.C, van den Boogaan, P., Mulders, J.W., and Crootenhuis, P.D. 
1997. Structure-based design and protein engineering of intersubunii disul- 
fide bonds in gonadotropins. Not. Biotedmol. 15: 658-662. 



Hellinga, H.W. and Richards. F.M. 1994. Optimal sequence selection in proteins 
of known structure by simulated evolution. Proc. Natl Acad. ScL 91: 5803- 
5807. 

Herman, A.C., Boone, T.C, and Lu. H.S. 1996. Characterization, formulation, 
and stability of Neupogen (Filgrastim), a recombinant human granulocyle- 
colony stimulating factor. Phami. Biotechnol. 9: 303-328. 

Hill. CP., Osslund, T.D., and Eisenberg, D. 1993. The structure of granulocyte- 
colony-stimulating factor and its relationship to other growth factors. Proc 
Natl Acad. ScL 90: 5 1 67-5 171. 

Home, P.D., Barriocanal, L., and Undholm. A. 1999. Comparative pharmaco- 
kinetics and pharmacodynamics of the novel rapid-acting insulin analogue, 
insulin aspart, in healthy volunteers. Eur. J. Clin. Pharmacol. 55: 199-203. 

Horan, T., Wen, J., Narhi, L. t Parker, V., Garcia, A., Arakawa, T^ and Philo, J. 

1996. Dimerization of the extracellular domain of granuloycte-colony 
stimulating factor receptor by ligand binding: A monovalent ligand induces 
2:2 complexes. Biochemistry 35: 4886-4896. 

Howey, D.C. Bowsher, R.R., Bnmelle, R.L, and Woodworth, J.R. 1994. 
|Lys(B28), Pro(B29)l-human insulin. A rapidly absorbed analogue of hu- 
man insulin. Diabetes 43: 396-402. 

Ishikawa, M., lijima, H„ Saiake-Ishikawa, R., Tsumura, H„ Iwamatsu, A., 
Kadoya. T., Shimada, Y., Fukamachi. H., Kobayashi, K., Matsuki, S., et al. 
1992. The substitution of cysteine 17 of recombinant human G-CSF with 
alanine greatly enhanced its stability. Cell Sirua. Fund. 17: 61-65. 

Jiang, X., Farid, H, Pistor, E., and Farid, R.S. 2000. A new approach to the 
design of uniquely folded thermally stable proteins. Protein Sci. 9: 403-4 16. 

Kolvenbach. C.G., Narhi, L.O., Philo, J.S., Li, T, Zhang, M., and Arakawa. T. 

1997. Granulocyte-colony stimulating factor maintains a thermally stable, 
compact, partially folded structure at pH2. J. Pept. Res. 50: 310-318. 

Kraemer-Pecore. CM., WollacotL A.M., and Desjarlais, J.R. 2001. Computa- 
tional protein design. Curr. Opin. Chem. Biol 5: 690-695. 

Kuga, T., Komatsu, Y., Yamasaki, M., Sckine. S., Miyaji, H., Nishi, T., Sato, 
M., Yokoo, Y., Asano, M., Okabe, M., et al. 1989. Mutagenesis or human 
granulocyte colony stimulating factor. Biochem. Biophys. Res. Commun 
159:103-111. 

Lovejoy, B., Cascio, D, and Eisenberg. D. 1993. Crystal structure of canine and 
bovine granulocyte-colony stimulating factor (G-CSF). J. Mol Biol. 234: 
640-653. 

Lowman. H.B. and Wells, J.A. 1993. Affinity maturation or human growih 
hormone by monovalent phage display. J. Mol. Biol. 234: 564-578. 

Lu. H.S., Clogston, C.L.. Nami, LO., Mercwelhcr, L.A.. Pearl, W.R.. and 
Boone, T.C. 1992. Folding and oxidation of recombinant human granulo- 
cyte colony stimulating factor produced in Escherichia coll Characteriza- 
tion of the disulfide-reduced intermediates and cysteine — serine analogs. J. 
Biol Chem. 267: 8770-8777. 

Malakauskas, S. and Mayo. S. 1998. Design, structure and stability or a hyper- 
ihermophilic protein variant. Nat. Struct. Biol. 5: 470-475. 

Marshall, S.A. and Mayo, S.L. 2001. Achieving stability and conformational 
specificity in designed proteins via binary patterning. J. Mol. Biol 305: 
619-631. 

Mayo, S.L., Olafson, B.D., and Goddard III. W.A. 1990, Dreiding: A generic 
force field for molecular simulations. /. Phys. Chem. 94: 8897-8909. 

Okabe, M., Asano, M., Kuga, T. r Komatsu, Y., Yamasaki, M.. Yokoo, Y.. Itoh, 
S., Morimoto, M.. and Oka, T. 1990. in vitro and in vivo hematopoietic 
effect of mutant human granulocyte colony-stimulating factor. Blood 75: 
1788- J 793. 

Pokala, N. and Handel, T.M. 2001. Review: Protein design— where we were, 
where we are, where we're going. J. Struct. Biol 134: 269-281. 

Reidhaar-OIson, J.F., De Souza-Han. J.A., and Selick, H E. 1996. Identification 
of residues critical to the activity of human granulocyte colony-stimulating 
factor. Biochemistry 35: 9034-9041. 

Shimaoka. M., ShiTman, J.M., Jing, H., Takagi, J., Mayo, S.L., and Springer, 
TJ\. 2000. Computational design of an iniegrin I domain stabilized in the 
open high affinity conformation. Nat. Struct. Biol. 7: 674-678. 

Stemmer. W.P. 1994. Rapid evolution of a protein in vitro by DNA shuffling. 
Nature 370: 389-391. 

Street, A.G. and Mayo, S.L 1999. Computational protein design. Structure 
Fold. Des. 7: R 105-1 09. 

Strop. P. and Mayo, S.L. 1999. Rubredoxin variant folds without iron. / Am. 
Chem. Soc. 121: 2341-2345. 

Young. D.C, Zhan, H., Cheng. QU Hou, J., and Matthews, DJ. 1997. Char- 
acterization of the receptor binding determinants of granulocyte colony 
stimulating factor. Protein Sci. 6: 1228-1236. 

Zhao, H.. Giver. L., Shao. Z, Affholter. J.A., and Arnold, F.H. 1998. Molecular 
evolution by staggered extension process (StEP) in vitro recombination. 
Nat. Biotechnol 16: 258-261. 



1226 Protein Science, vol. 11 



